123 research outputs found

    Comparative phylogenomic analyses of teleost fish Hox gene clusters: lessons from the cichlid fish Astatotilapia burtoni

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Teleost fish have seven paralogous clusters of Hox genes stemming from two complete genome duplications early in vertebrate evolution, and an additional genome duplication during the evolution of ray-finned fish, followed by the secondary loss of one cluster. Gene duplications on the one hand, and the evolution of regulatory sequences on the other, are thought to be among the most important mechanisms for the evolution of new gene functions. Cichlid fish, the largest family of vertebrates with about 2500 species, are famous examples of speciation and morphological diversity. Since this diversity could be based on regulatory changes, we chose to study the coding as well as putative regulatory regions of their Hox clusters within a comparative genomic framework.</p> <p>Results</p> <p>We sequenced and characterized all seven Hox clusters of <it>Astatotilapia burtoni</it>, a haplochromine cichlid fish. Comparative analyses with data from other teleost fish such as zebrafish, two species of pufferfish, stickleback and medaka were performed. We traced losses of genes and microRNAs of Hox clusters, the medaka lineage seems to have lost more microRNAs than the other fish lineages. We found that each teleost genome studied so far has a unique set of Hox genes. The <it>hoxb7a </it>gene was lost independently several times during teleost evolution, the most recent event being within the radiation of East African cichlid fish. The conserved non-coding sequences (CNS) encompass a surprisingly large part of the clusters, especially in the HoxAa, HoxCa, and HoxDa clusters. Across all clusters, we observe a trend towards an increased content of CNS towards the anterior end.</p> <p>Conclusion</p> <p>The gene content of Hox clusters in teleost fishes is more variable than expected, with each species studied so far having a different set. Although the highest loss rate of Hox genes occurred immediately after whole genome duplications, our analyses showed that gene loss continued and is still ongoing in all teleost lineages. Along with the gene content, the CNS content also varies across clusters. The excess of CNS at the anterior end of clusters could imply a stronger conservation of anterior expression patters than those towards more posterior areas of the embryo.</p

    The genetic basis of energy conservation in the sulfate-reducing bacterium Desulfovibrio alaskensis G20.

    Get PDF
    Sulfate-reducing bacteria play major roles in the global carbon and sulfur cycles, but it remains unclear how reducing sulfate yields energy. To determine the genetic basis of energy conservation, we measured the fitness of thousands of pooled mutants of Desulfovibrio alaskensis G20 during growth in 12 different combinations of electron donors and acceptors. We show that ion pumping by the ferredoxin:NADH oxidoreductase Rnf is required whenever substrate-level phosphorylation is not possible. The uncharacterized complex Hdr/flox-1 (Dde_1207:13) is sometimes important alongside Rnf and may perform an electron bifurcation to generate more reduced ferredoxin from NADH to allow further ion pumping. Similarly, during the oxidation of malate or fumarate, the electron-bifurcating transhydrogenase NfnAB-2 (Dde_1250:1) is important and may generate reduced ferredoxin to allow additional ion pumping by Rnf. During formate oxidation, the periplasmic [NiFeSe] hydrogenase HysAB is required, which suggests that hydrogen forms in the periplasm, diffuses to the cytoplasm, and is used to reduce ferredoxin, thus providing a substrate for Rnf. During hydrogen utilization, the transmembrane electron transport complex Tmc is important and may move electrons from the periplasm into the cytoplasmic sulfite reduction pathway. Finally, mutants of many other putative electron carriers have no clear phenotype, which suggests that they are not important under our growth conditions, although we cannot rule out genetic redundancy

    The complete plastid genome sequence of Welwitschia mirabilis: an unusually compact plastome with accelerated divergence rates

    Get PDF
    Background Welwitschia mirabilis is the only extant member of the family Welwitschiaceae, one of three lineages of gnetophytes, an enigmatic group of gymnosperms variously allied with flowering plants or conifers. Limited sequence data and rapid divergence rates have precluded consensus on the evolutionary placement of gnetophytes based on molecular characters. Here we report on the first complete gnetophyte chloroplast genome sequence, from Welwitschia mirabilis, as well as analyses on divergence rates of protein-coding genes, comparisons of gene content and order, and phylogenetic implications. Results The chloroplast genome of Welwitschia mirabilis [GenBank: EU342371] is comprised of 119,726 base pairs and exhibits large and small single copy regions and two copies of the large inverted repeat (IR). Only 101 unique gene species are encoded. The Welwitschia plastome is the most compact photosynthetic land plant plastome sequenced to date; 66% of the sequence codes for product. The genome also exhibits a slightly expanded IR, a minimum of 9 inversions that modify gene order, and 19 genes that are lost or present as pseudogenes. Phylogenetic analyses, including one representative of each extant seed plant lineage and based on 57 concatenated protein-coding sequences, place Welwitschia at the base of all seed plants (distance, maximum parsimony) or as the sister to Pinus (the only conifer representative) in a monophyletic gymnosperm clade (maximum likelihood, bayesian). Relative rate tests on these gene sequences show the Welwitschia sequences to be evolving at faster rates than other seed plants. For these genes individually, a comparison of average pairwise distances indicates that relative divergence in Welwitschia ranges from amounts about equal to other seed plants to amounts almost three times greater than the average for non-gnetophyte seed plants. Conclusion Although the basic organization of the Welwitschia plastome is typical, its compactness, gene content and high nucleotide divergence rates are atypical. The current lack of additional conifer plastome sequences precludes any discrimination between the gnetifer and gnepine hypotheses of seed plant relationships. However, both phylogenetic analyses and shared genome features identified here are consistent with either of the hypotheses that link gnetophytes with conifers, but are inconsistent with the anthophyte hypothesis

    The Enhancer of split complex arose prior to the diversification of schizophoran flies and is strongly conserved between Drosophilaand stalk-eyed flies (Diopsidae)

    Get PDF
    In Drosophila, the Enhancer of split complex (E(spl)-C) comprises 11 bHLH and Bearded genes that function during Notch signaling to repress proneural identity in the developing peripheral nervous system. Comparison with other insects indicates that the basal state for Diptera is a single bHLH and Bearded homolog and that the expansion of the gene complex occurred in the lineage leading to Drosophila. However, comparative genomic data from other fly species that would elucidate the origin and sequence of gene duplication for the complex is lacking. Therefore, in order to examine the evolutionary history of the complex within Diptera, we reconstructed, using several fosmid clones, the entire E(spl)-complex in the stalk-eyed fly, Teleopsis dalmanni and collected additional homologs of E(spl)-C genes from searches of dipteran EST databases and the Glossina morsitans genome assembly. Comparison of the Teleopsis E(spl)-C gene organization with Drosophila indicates complete conservation in gene number and orientation between the species except that T. dalmanni contains a duplicated copy of E(spl)m5 that is not present in Drosophila. Phylogenetic analysis of E(spl)-complex bHLH and Bearded genes for several dipteran species clearly demonstrates that all members of the complex were present prior to the diversification of schizophoran flies. Comparison of upstream regulatory elements and 3' UTR domains between the species also reveals strong conservation for many of the genes and identifies several novel characteristics of E(spl)-C regulatory evolution including the discovery of a previously unidentified, highly conserved SPS+A domain between E(spl)mΞ³ and E(spl)mΞ². Identifying the phylogenetic origin of E(spl)-C genes and their associated regulatory DNA is essential to understanding the functional significance of this well-studied gene complex. Results from this study provide numerous insights into the evolutionary history of the complex and will help refine the focus of studies examining the adaptive consequences of this gene expansion.https://doi.org/10.1186/1471-2148-11-35

    Parallel Loss of Plastid Introns and Their Maturase in the Genus Cuscuta

    Get PDF
    Plastid genome content and arrangement are highly conserved across most land plants and their closest relatives, streptophyte algae, with nearly all plastid introns having invaded the genome in their common ancestor at least 450 million years ago. One such intron, within the transfer RNA trnK-UUU, contains a large open reading frame that encodes a presumed intron maturase, matK. This gene is missing from the plastid genomes of two species in the parasitic plant genus Cuscuta but is found in all other published land plant and streptophyte algal plastid genomes, including that of the nonphotosynthetic angiosperm Epifagus virginiana and two other species of Cuscuta. By examining matK and plastid intron distribution in Cuscuta, we add support to the hypothesis that its normal role is in splicing seven of the eight group IIA introns in the genome. We also analyze matK nucleotide sequences from Cuscuta species and relatives that retain matK to test whether changes in selective pressure in the maturase are associated with intron deletion. Stepwise loss of most group IIA introns from the plastid genome results in substantial change in selective pressure within the hypothetical RNA-binding domain of matK in both Cuscuta and Epifagus, either through evolution from a generalist to a specialist intron splicer or due to loss of a particular intron responsible for most of the constraint on the binding region. The possibility of intron-specific specialization in the X-domain is implicated by evidence of positive selection on the lineage leading to C. nitida in association with the loss of six of seven introns putatively spliced by matK. Moreover, transfer RNA gene deletion facilitated by parasitism combined with an unusually high rate of intron loss from remaining functional plastid genes created a unique circumstance on the lineage leading to Cuscuta subgenus Grammica that allowed elimination of matK in the most species-rich lineage of Cuscuta

    Functional genomics with a comprehensive library of transposon mutants for the sulfate-reducing bacterium Desulfovibrio alaskensis G20.

    Get PDF
    UnlabelledThe genomes of sulfate-reducing bacteria remain poorly characterized, largely due to a paucity of experimental data and genetic tools. To meet this challenge, we generated an archived library of 15,477 mapped transposon insertion mutants in the sulfate-reducing bacterium Desulfovibrio alaskensis G20. To demonstrate the utility of the individual mutants, we profiled gene expression in mutants of six regulatory genes and used these data, together with 1,313 high-confidence transcription start sites identified by tiling microarrays and transcriptome sequencing (5' RNA-Seq), to update the regulons of Fur and Rex and to confirm the predicted regulons of LysX, PhnF, PerR, and Dde_3000, a histidine kinase. In addition to enabling single mutant investigations, the D.&nbsp;alaskensis G20 transposon mutants also contain DNA bar codes, which enables the pooling and analysis of mutant fitness for thousands of strains simultaneously. Using two pools of mutants that represent insertions in 2,369 unique protein-coding genes, we demonstrate that the hypothetical gene Dde_3007 is required for methionine biosynthesis. Using comparative genomics, we propose that Dde_3007 performs a missing step in methionine biosynthesis by transferring a sulfur group to O-phosphohomoserine to form homocysteine. Additionally, we show that the entire choline utilization cluster is important for fitness in choline sulfate medium, which confirms that a functional microcompartment is required for choline oxidation. Finally, we demonstrate that Dde_3291, a MerR-like transcription factor, is a choline-dependent activator of the choline utilization cluster. Taken together, our data set and genetic resources provide a foundation for systems-level investigation of a poorly studied group of bacteria of environmental and industrial importance.ImportanceSulfate-reducing bacteria contribute to global nutrient cycles and are a nuisance for the petroleum industry. Despite their environmental and industrial significance, the genomes of sulfate-reducing bacteria remain poorly characterized. Here, we describe a genetic approach to fill gaps in our knowledge of sulfate-reducing bacteria. We generated a large collection of archived, transposon mutants in Desulfovibrio alaskensis G20 and used the phenotypes of these mutant strains to infer the function of genes involved in gene regulation, methionine biosynthesis, and choline utilization. Our findings and mutant resources will enable systematic investigations into gene function, energy generation, stress response, and metabolism for this important group of bacteria

    Complete plastid genome sequences of Drimys, Liriodendron, and Piper: implications for the phylogenetic relationships of magnoliids

    Get PDF
    BACKGROUND: The magnoliids with four orders, 19 families, and 8,500 species represent one of the largest clades of early diverging angiosperms. Although several recent angiosperm phylogenetic analyses supported the monophyly of magnoliids and suggested relationships among the orders, the limited number of genes examined resulted in only weak support, and these issues remain controversial. Furthermore, considerable incongruence resulted in phylogenetic reconstructions supporting three different sets of relationships among magnoliids and the two large angiosperm clades, monocots and eudicots. We sequenced the plastid genomes of three magnoliids, Drimys (Canellales), Liriodendron (Magnoliales), and Piper (Piperales), and used these data in combination with 32 other angiosperm plastid genomes to assess phylogenetic relationships among magnoliids and to examine patterns of variation of GC content. RESULTS: The Drimys, Liriodendron, and Piper plastid genomes are very similar in size at 160,604, 159,886 bp, and 160,624 bp, respectively. Gene content and order are nearly identical to many other unrearranged angiosperm plastid genomes, including Calycanthus, the other published magnoliid genome. Overall GC content ranges from 34–39%, and coding regions have a substantially higher GC content than non-coding regions. Among protein-coding genes, GC content varies by codon position with 1st codon > 2nd codon > 3rd codon, and it varies by functional group with photosynthetic genes having the highest percentage and NADH genes the lowest. Phylogenetic analyses using parsimony and likelihood methods and sequences of 61 protein-coding genes provided strong support for the monophyly of magnoliids and two strongly supported groups were identified, the Canellales/Piperales and the Laurales/Magnoliales. Strong support is reported for monocots and eudicots as sister clades with magnoliids diverging before the monocot-eudicot split. The trees also provided moderate or strong support for the position of Amborella as sister to a clade including all other angiosperms. CONCLUSION: Evolutionary comparisons of three new magnoliid plastid genome sequences, combined with other published angiosperm genomes, confirm that GC content is unevenly distributed across the genome by location, codon position, and functional group. Furthermore, phylogenetic analyses provide the strongest support so far for the hypothesis that the magnoliids are sister to a large clade that includes both monocots and eudicots
    • …
    corecore